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ANALYSIS OF TOP TO BOTTOM-K SHUFFLES 

By Sharad Goel 1 

Cornell University 

A deck of n cards is shuffled by repeatedly moving the top card to 
one of the bottom k n positions uniformly at random. We give upper 
and lower bounds on the total variation mixing time for this shuffle as 
k n ranges from a constant to n. We also consider a symmetric variant 
of this shuffle in which at each step either the top card is randomly 
inserted into the bottom k n positions or a random card from the 
bottom k„ positions is moved to the top. For this reversible shuffle 
we derive bounds on the L 2 mixing time. Finally, we transfer mixing 
time estimates for the above shuffles to the lazy top to bottom-fc 
walks that move with probability 1/2 at each step. 

1. Introduction. A deck of n cards can be shuffled by repeatedly remov- 
ing the top card and inserting it uniformly at random back into the deck. 
A coupling argument shows that the total variation mixing time for this 
Markov chain is rafogre (see, e.g., [1, 2, 12]). In fact, a detailed analysis 
yields a closed form expression for the distribution of this chain after any 
number of steps (see [3] ) . 

Here we analyze a class of walks that generalizes the top to random chain, 
namely, top to bottom-A: shuffles. These shuffles are generated by moving the 
top card uniformly at random to any of the bottom k n positions of the deck. 
For k n = n, we recover the top to random walk. For k n = 2, this is the 
Rudvalis shuffle, and upper and lower bounds of order n 3 logn have been 
shown by Hildebrand [9] and Wilson [15], respectively. 

More formally, let S n be the permutation group, and let a £ S n denote 
an element of this group, interpreting a{i) = j to mean that position i holds 
the card with label j. Fix n > k n > 1, and denote a cycle permutation by 

<7, = (1,2,...,0, 
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where <Ji{i) =i + l for 1 < i < I — 1, 07 (Z) = 1, and cr/(i) = i otherwise. Define 
the probability measure g n ,A; n on 5 n by 

/ _ J — , if a = a; for some n — k n + 1 < I < n, 
Qn,k n (crj - !;„ 

[ 0, otherwise, 

and let n be the uniform distribution on S n . Then the top to bottom- k shuffle 
driven by q nkn is nonreversible, aperiodic and irreducible with stationary 
distribution tt. 

Let u denote the bottom-fe to top shuffle. It is well known that studying 
this reversed shuffle is equivalent to studying q n ,k n (see Section 2). Then for 
the top to bottom-^ walk q n ,k n i and the reversible variant q n ,k n = \ {qn,k n + 
g* fe ), we derive bounds on the total variation and L 2 mixing times T and T%. 
Finally, we show that results for the nonreversible and reversible chains yield 
as corollaries bound on the lazy top to bottom-A; shuffle 

q n ,k n = \ {q n ,k n + S e ), 

where we put weight on the identity. 

In particular, our main results are summarized below. In these statements, 
A(c), B(c), and so on, denote positive, finite constants that may depend on 
the fixed parameter c but not on n. 

Theorem 1.1. For the top to bottom-k shuffle q n ,k n : 

(1) if k n >n— \J (ralogn)/% then 

T(S n ,q n:kn ) ~?ilogn; 

(2) if k n > cn with c € (0, 1), then 

A{c)n\ogn< T(S n ,q n , kn ) < B(c)n 2 log 2 n; 

(3) if k n < C , then 

A(C)n 3 < T(S n ,q ntkn ) < B{C)nHogn; 

(4) ifk n = 2,3, then 

An 3 logn < T(S n ,q n , kn ) < Bn 3 logn. 

For Theorem 1.1(1), the notation ~ indicates that the walk presents a 
total variation cut-off at time nlogn. See Lemma 3.2 for a precise statement 
of the result. After this paper was submitted, the author learned of the 
work of Jonasson [10], who shows that nonreversible top to bottom-fc shuffles 
have total variation mixing time T(S n ,q n & n ) ~ n 3 log n/k 2 uniformly for all 
choices of k n . 
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Theorem 1.2. Let q n ,k n = \{ln,k n + Qnk ) ^ e ^ e additive symmetriza- 
tion of the top to bottom-k shuffle. Then: 

(1) ifk n >n — C, then 

T(S n ,q n:kn ) < T 2 (S n ,q n , kn ) < B(C)n\ogn; 

(2) if k n < cn with c 6 (0, 1), then 

T 2 {S n ,q ntkn ) > T(S n ,q n>kn ) > A(c)n 2 ; 

and 

m m \ A(c)n 3 

T 2 (S n ,q n>kn ) > — -2 — logn; 

(3) /or any k n , 

T(S n ,q ntkn ) < T 2 (S n ,q njkn ) < Bn 3 \ogn. 
In particular, (2) and (3) s/iow i/iai i/ k n < C , then 

A{C)n A \ogn < T 2 (S n ,q n , kn ) < Bn 3 logn. 

The two lower bounds in Theorem 1.2(2) are complimentary in the sense 
that the first gives better estimates for k n ~ cn, while the second works best 
for k n <C cn. 

Theorem 1.3. For the lazy top to bottom-k shuffle q n ,k„ : 
(1) if k n >n — C , then 

,4nlogn < T 2 (S n ,q nikn ) < B(C)n\ogn; 



(2) if k n >n — (nlogn) /2, then 

T(S n ,q n>kn ) ~ 2nlogn; 

(3) if k n > cn with c € (0, 1), then 

A(c)nlogn < T(S n ,q n>kn ) < B(c)n 2 log 2 n\ 

(4) if k n = 2,3, then 

An 3 logn < T(S n ,q n , kn ) < Bn 3 logn; 

(5) for any k n , 

T 2 (S n ,q n<kn ) <Bn 3 logn. 
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For Theorem 1.3(2), the notation ~ indicates that the walk presents a to- 
tal variation cut-off at time 2nlogn. See Remark 5.1 for a precise statement 
of the result. Also observe that the estimates of Theorem 1.3(1) (2) bound 
the 1? mixing time T2 and the total variation mixing time T, respectively. 

As k n varies from a constant to n, these results are most satisfactory at 
the extremes of the range. For large k n the walks behave like the top to 
random chain, mixing in nlogn steps. Theorem 1.1(1) proves mixing in the 
strongest possible sense: cut-off at precisely ralogn. Let us note here that the 
precise L 2 cut-off time is not yet known even for the top to random shuffle 

Qn,rf 

For small k n , the walks behave like the Rudvalis shuffle, mixing in n 3 log?i 
steps. Theorem 1.2 proves this for the reversible chain, whereas Theorems 
1.1 and 1.3 give complete results only for k n — 2,3. 

The worst gap in these results occurs when k n ~ n/2. For these "top to 
bottom half" shuffles, [10] shows a O(nlogra) mixing time for the nonre- 
versible shuffle, and our results give an Q(n 2 ) lower bound for the reversible 
shuffle. In particular, the nonreversible and reversible top to bottom half 
shuffles mix at different rates. In this range, one difficulty in analyzing the 
reversible walk is that comparison with random transposition, one of the 
best understood models of random walk, can at best yield 0(n 3 logn) upper 
bounds (see Lemma 4.5). 

A variety of methods are used to prove the results of this paper. The upper 
bounds for the nonreversible top to bottom-A; shuffle are found by coupling 
arguments. The lower bound in Theorem 1.1(4) uses Wilson's lemma (see, 
e.g., [13, 15]). For the reversible chain, we use comparison techniques for 
walks on finite groups to prove both upper and lower bounds (see, e.g., [4]). 
Notably, comparison previously has been applied only to find upper bounds. 
It appears that this is the first application of comparison techniques to prove 
lower bounds. 

In Section 2 we introduce our notation and review basic Markov chain 
theory. Sections 3 and 4 give proofs of Theorems 1.1 and 1.2, respectively. 
Finally, Section 5 applies results from the previous sections to find bounds 
on the lazy walk q U) k n . 

2. Notation and basics. Let G be a finite group with probability measure 
q, and let {rji} be G- valued independent random variables with distribution 
q. The left-invariant walk on G driven by q is defined by Xq = e and 

Xk+i =Xk ■ rjk- 

For the top to bottom-A; measure q n k n , this definition corresponds to the in- 
formal card shuffling description given in the Introduction. See, for example, 
[12] for more details. Define convolution powers of q by 

q m (9) =q m - l *q(9) = £ q m ~ l {h)q(h~ l g) . 
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Equivalently, q m (g) can be thought of as the sum of weighted paths: 

m 

« m (s)=En*). 

where, for each fixed g, the sum is taken over m-tuples (h±, . . . ,h m ) such 
that hi ■ ■ ■ h m = g. 

If supp(q) = {g : q(g) > 0} is not contained in a proper subgroup of G or 
in a coset of a proper normal subgroup, then 

q m (g) 



\G\ 



as m 



oo. 



Our results give bounds on the rate of convergence for q n ^k n an d its variants. 
Intuitively, these results are estimates on the number of top to bottom-/c 
shuffles needed to mix a deck of cards. To make this statement more precise, 
we first need a way to measure distance between the distribution of the chain 
at time m and the stationary distribution. For probability measures q and 
7r on a finite group G, define the total variation distance as 

\\q - 7t||tv = sup \q(A) - tt(A)\ = \ ^ \q(g) - ir(g)\. 



AcG 



Alternatively, some of our results will be in terms of the L p (tt) distance 



d-K, P {q) 



IT 



Lp(tt) 



\g<£G 



i 



i/p 



Observe that \\q — 7t||tv = 5^,1(9)- Our results are for the cases p = 1, 2. 

We define the deck to be shuffled when the distance between the distribu- 
tion of the deck and the stationary distribution is small. Namely, the total 
variation mixing time is given by 

1 



2e 



1 

7T 



LP 



1 

< - 

e 



T(G, q) = inf |m| \\q m — 7t||tv < 

and the L p mixing time by 

T P (G, q) = inf |m 

With these definitions, by Jensen's inequality, 

T(G,q)=T 1 (G,q)<T 2 (G,q). 

Moreover, the functions k i— > d- KiP (q k ) are nonincreasing and sub-additive. In 
particular, for k > T p (G,q), 



d np i.g k )<e-W T '^. 
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This inequality motivates our somewhat arbitrary choice of 1/e in the defi- 
nition of mixing time. For details, see, for example, [6, 12]. 

The Markov operator Q associated to a probability measure q on G is 
given by Qf = f*q*, where q*(g) = q{g~ 1 )- The reversed random walk is 
driven by q* and has as its associated operator the adjoint of Q. That is, q* 
has associated Markov operator Q*f = f *q. 

Note that since we are on a group, the stationary measure tt is uniform, 
and furthermore, 



E 

96G 



via) 



1 



9 eG 



*(g) 



i 



Consequently, d n:P (q) = d ntP (q*), and with respect to analyzing mixing time, 
we can study either the walk or its reversal. 

For a sequence of numbers a n ,b n , we use the notation a n -< b n to indicate 
that there is a universal constant C > (independent of n) such that a n < 
Cb n . For the two-sided bound, we use a n ~ b n to indicate that there are 
constants c, C > such that ca n <b n < Ca n . For mixing times T(n), the 
notation T(n) ~ a n indicates cut-off at time a n . For a precise definition of 
cut-off, see, for example, [11]. 



3. Proof of Theorem 1.1. In this section we present upper and lower 
bounds for the mixing time of the nonreversible walk q n ,k n , using primarily 
probabilistic techniques. For k n = 3, we use the method of [15] to derive a 
lower bound. 

To prove mixing time bounds for the top to bottom-/c shuffle, we make 
extensive use of the following well-known coupling result (see, e.g., [1, 2, 12]). 

Theorem 3.1. Let q be a probability measure on a finite group G. Let 
(Xn,X%) be a coupling for the random walk driven by q with (X^) starting 
at the identity and (X%) starting from the stationary distribution tt [i.e., 
dist(X^) =vr]. Then 

\\q m -vr|| T v<IP(T>m), 

where 

T = inf{m|VA; > m, X\ = X%}. 

Furthermore, there exists a coupling such that the inequality above is an 
equality. 



We will also make use of the following coupon-collectors lemma (see, e.g., 
[I])- 
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Lemma 3.1 (Coupon-collectors lemma). Let R m be the number of dis- 
tinct cards obtained in m uniform random draws with replacement from a 
deck of n cards. That is, R m = \{C\, . . . ,C m }\ with C% i.i.d. uniform on 
{1, . . . , n}. Let Lj = mm{m\R m = n — j}, that is, the number of draws be- 
fore all but j cards have been chosen. Then for fixed j , 

L i 

——I > 1 in probability. 

re log n 

In the case of q nn , that is, the top to random shuffle, the correct mixing 
time n log n can be found using a coupling of the time reversed process q^ n- 
For this random to top shuffle, the coupling is as follows: choose a label 
uniformly at random from { 1 , . . . , n} and in each deck move the card with 
this label to the top. Clearly, this is a coupling, and the coupling time is 
given by the coupon-collectors lemma (for details, see, e.g., [1]). The proof 
of Lemma 3.2 is by a similar coupling. 

Lemma 3.2. For k n >n — J ^relogre, the walk (S n , q n ,k n ) presents a total 
variation cut-off at t n = re log n. That is, for (0,1), 



and 



,. li (l+e)nlogn ,, _ 

hm \\q y , 6 - 7r TV = 



lim \\J l -^ nl ° sn _ vrll - 1 



Proof. Since diy{p (jl \u) = d^\{p* ( n ',u), we can consider the reversed 
random walk u ■ For this reversed walk, we define a coupling (X^^X™), 
where X\ starts from the identity and X2 is drawn from the stationary 
distribution. Let 

Af = {X™(i)\n-k n <i<n}, i = 1,2. 

That is, A™ is the set of cards that at time m are in the bottom k n positions 
of deck j. At time m, in the first deck pick a card a a uniformly at random 
from A™ and move it to the top of the deck. If o~ a E A™ , then move card a a 
in the second deck to the top. If not, then in the second deck uniformly at 
random pick a card from A™ \ A™ and move it to the top. 

Clearly, deck one is driven by q* kn . For the second deck, note that any 
card in A™ fl j4™ is chosen if and only if it is chosen in the first deck, 
and hence, with probability l/k n . And cards in A™ \ A™ are chosen with 
probability 

k n -\AfnA^\ 1 _ 1 

h h — \ A m n A m \ ~ h ' 

h, n h, n I Si^ I I I ri"n 
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So this is, in fact, a coupling. Define 

t = M{m\X^{i) = X?(i) for 1 < i < n - k n }. 

That is, ro is the first time the top n — k n cards are matched in both decks. 
Then for m > To, A™ = A™, that is, the set of cards in the bottom k n 
positions are the same in each deck. Consequently, after time ro, new matches 
are not broken and every time an unmatched card is chosen, a new match 
is made. 

First we estimate tq. Let L be the probability that, starting with all cards 
unmatched, n — k n consecutive matches are made. Then, 



i x 7(1/2)71 log n 

> I 1 



01/7(1/2) logn-L 
1 

Furthermore, by the Markov property, for fixed e > 0, 



P(t~q > sn log n) < P ( tq > E\J n log n ■ y — n log n 



1 



< 



n— >oo g 



7^/7(1/2) log n-l) 



\ 7(l/2)nlogn 



r 7 n i°g n 



Let ri be the time it takes after ro for each card in A\° = A2 to be selected. 
That is, 

n = inf{m|m > 0, each card in A^° has been selected by time m + ro}. 
By the coupon-collectors lemma, for fixed e > 0, 

lim P(T 1 >(l + s)k n \ogk n ) = 0. 

n— >oo 

Finally, if T is the coupling time, then since 
P(T > (1 +e)nlogn) < P^r > |nlogn^ + p(t\ > ( 1 + 0nlogn 

n— >oo q 

by Theorem 3.1, 

lim ||<£+ e)Blo8n -7r|| TV = 0. 
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The lower bound argument is analogous to that of the top to random shuffle 
(see, e.g., [1]). Let Bj be the set of permutations a for which the bottom j 
cards have increasing labels. That is, 

a(n-j + 1) < cr(n-j + 2) <■■■ < a{n). 

Then ir{Bj) = ij. Starting from the identity, let Lj be the number of shuffles 

until all but j of the cards with labels in {re — k n + 1, . . . , n} have been chosen. 
Then, if Lj > to, the bottom j cards after to bottom k n to top shuffles are 
in increasing order. So for fixed e > 0, there is an e' > such that 

lliX £)nlOS " - tIItv ^ ,£m > ( X - £ W°g™) " j 

> lim P(L 3 - > (1 - e')k n \ogk n ) - - 

n— >oo J- 

since 

A; n log/s n 
hm — = 1. 

n— *oo n log ?7, 

Using the coupon-collectors lemma, the result follows. □ 

Lemmas 3.3 and 3.4 below bound the mixing time of q n ,k n hi the cases 
where k n is relatively large and when k n is small. Both lemmas rely on the 
following coupling. 

We construct a coupling (X™,X™) where X\ starts from the identity 
and X2 is drawn from the uniform distribution. Recall that the notation 
X™(i) =j can be interpreted to mean that at time to position i in deck s 
holds the card with label j. Let 

A™ = {X™(i)\n-k n + 2<i<n}, a = 1,2. 

Note that A^ 1 is not the set of cards in the bottom k n positions (to which 
the top card can be sent), but rather only the cards in the bottom k n — 1 
positions. 

We define a coupling as follows: first pick one of the two decks with equal 
probability. Say we picked deck one. Then X\ proceeds as usual by uniformly 
at random moving the top card to one of the bottom k n positions; X2 mimics 
the moves of X\ except in a couple of cases. If Xj n (l) € ^4™ (i- e -> top 
card in the first deck is in A™), and the first deck moves the top card 
to position (X™)~ 1 (Xf l (l)), then the second deck moves the top card to 
(X^)- 1 ^ (1)) - 1. And, if Xf(l) G Af and the first deck moves the top 
card to (X™)" 1 (X™ (1)) — 1, then the second deck moves the top card to 
(X™) -1 (X™ (1)) . We have an analogous description if we originally picked 
deck two. Accordingly, if card i is on the top of one deck and in the bottom 
k n — l positions of the other deck, then, with probability l/k n , it will couple 
on the next move. Furthermore, matches between the decks are never broken. 
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Lemma 3.3. For c 6 (0, 1) and k n > ere, i/iere exist constants A{c) such 
that the total variation mixing time for the walk driven by q n ,k„ satisfies 



Proof. We use the coupling described above. Let Tj be the first time 
that the cards with label j couple in the two decks. That is, 



We estimate r ? - by first showing that, starting from any permutation of the 
decks, any card j has probability at least ^ to couple within 3nlogn steps. 
Let be the first time card j reaches the top of deck one, starting from 
state a. And let r CT be the first time the bottom card reaches position n — k n . 
Then for n sufficiently large, 



The second inequality follows from the fact that T a is the sum of independent 
geometric waiting times with means k n , k n /2, . . . , k n /k n , and, consequently, 
is equivalent to the coupon collectors problem. In particular, the above shows 
that, starting from any state, there is positive probability independent of n 
and k n that card j reaches the top of the first deck in 2relogre steps. 

When card j gets to the top of the first deck, we are in one of three 
situations: card j is already coupled, card j in the second deck is in the 
bottom k n — 1 positions, or card j in the second deck is in the top n — k n + 1 
positions. In the first two situations, card j will be coupled at the next step 
with probability at least l/k n (if j is already coupled, it will remain coupled 
at the next step). So we only need to consider the third situation. Assume 
card j moves to one of the bottom |~-BA; n ] positions for some B £ (0, 1) 
(which happens with probability at least B). Let tb be the first time j 
leaves the bottom k n — 1 positions. Then tb is the sum of independent 
geometric waiting times, and depends on the exact position in the bottom 
\Bk n ~\ to which card j moves. However, by construction, we have the lower 
bound 



T(S n ,q n>kn ) < An 2 log 2 n. 



T 3 =mf{m\(X?y 1 (j) = (X?y 1 (j)}. 



P{tI > 2nlogn) < P(r CT > 2nlogn — (n — k n )) 




1 

2' 



"■71 

Et b > ]T 



h n 1 



k 



1 1 



r=\Bk n ] 



r 



> K log 



1 



B + 1/kn ' 
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And, 

r=l 



<2kl. 



ii ' 



By Chebyshev's inequality, 



pfr B >^>l- 4Var(TB) 
3 2 J" 1 (Er B y 



>1 



log 2 i/(s + i/fc n ; 

Consequently, if we choose S and such that 

1 /2(l-c) 

log ; — > max , 3 

& B + l/K ~ V c 

where c is from the statement of the lemma, then there exists 5 > (inde- 
pendent of n) such that, for k n > K, 

P{r B >n-k n )> P{t b > Et b /2) > S. 

For instance, we can choose 5 = 1/9. But if t b > n — k n , then j will still be 
in the bottom k n — 1 positions of deck one when j reaches the top of deck 
two. Consequently, for each of the original three cases, after reaching the 
top of deck one, card j couples within the next n — k n steps with probability 
at least 5/n. Combining this with the bound on r^, for the coupling time Tj 
of card j, we have 

P(n < 3nlogn) > — . 
Moreover, by the Markov property, 

/ g \ Anlogn/3 

POrj > An 2 log 2 n) < f 1 

\ 2n 



< exp - 



5 A log n 



6 

Finally, if T is the coupling time for the two decks, then 

r)/rp • i 2i 2 \ ^ ( SAlogn 

Pll > An log n)<nexp 

V 6 

and the result follows by taking A sufficiently large. □ 
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Remark 3.1. Using the lower bound argument of Lemma 3.2, we can 
show that, for c £ (0, 1), k n > cn, there exist constants B(c) such that the 
mixing time satisfies 

T(S n ,q ntkn ) >B(c)n\ogn. 

The following lemma gives an upper bound on the mixing time for the 
walk driven by q nj k„ with k n < C. The coupling used to prove the result 
is the same as in Lemma 3.3, however, we analyze the coupling time by a 
different technique. 

Lemma 3.4. For k n < C , there exist constants A(C) such that the total 
variation mixing time for the walk driven by q n k n satisfies 

T(S n) q n>kn ) < An 3 log n. 

Proof. Using the coupling described above, we show that, starting from 
any permutation of the decks, any card i has probability at least 5 > 
(independent of n) to couple within n 3 steps. Fix card i and let r be the 
first time that card i is on the top of one deck and in the bottom k n — 1 
positions of the other. Then at the next step, the cards have probability 
l/k n to couple. Let 

t) =inf{t|Xj(l) =i}, 

T™ = mf{t>T™- l \Xt(l)=i}. 

That is, r™ is the time when card i is on top of deck j for the mth time. 
Without loss of generality, assume that t\ < r\. If t™ < r, then 

Tf <r 2 m <7f + n-k n . 

And if r]" +1 < r, then 

7f +1 <rf + 2(n-k n ). 

Define the random variables d™ = — (X™) -1 (i)] mod n, which 

give the oriented distance between the positions of the ith card in each deck. 
Note that d™ only changes when i is in the bottom k n — 1 positions in at 
least one deck. Let r* = inf{t > T±\X\(i) = n — k n + 1}. Then define as 
i.i.d. random variables with distribution given by 

P(Yl=t) d ^P(T*-rl = t). 

That is, gives the amount of time it takes a card to get from the top of 
the deck to the n — k n + 1 position. Furthermore, before r, the distribution of 
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the change in distance is given by cL 1 



T m Hi<it 
^ 1 u i£ L ym 



Y™ . Consequently, 



P{r[ 



m+l 



<t)<P 



< n 



Y2 ), and note that a < oo since Y"^ can be realized as a 



Let a 2 = Var(y/ 

finite sum of geometric waiting times. Then by the central limit theorem, by 
taking m = n 2 , we have that -PfVj 1 +1 < r) < 1 — e independent of n. That 
is, > e. Furthermore, since t\ < n with positive probability 

independent of n, P(t < 3n 3 ) > e. Consequently, there is a 5 > such that 
if Tj is the coupling time for card i, then P{ji < 3n 3 ) > 5. Finally, if T is the 
coupling time for the two decks, then 

P(T > An 3 log n) < nP(ri > An 3 logn) 

< n(l - 5) Alosn/3 

A— *oo q 



By taking A sufficiently large, the result follows from Theorem 3.1. □ 



Remark 3.2. For k n < C, the walk performed by one card under the 
measure q n .k n is an example of a class of walks known as necklace chains. 
By results in [14], this immediately yields the lower bound 

B{C)n 3 <T(S n ,q n>kn ). 

In [9], the Rudvalis shuffle q ni 2 is shown to have an upper bound of order 
0(n 3 logn). In [15], a matching lower bound for this shuffle is given by using 
Theorem 3.2. Here we show that the method of [15] can also used to lower 
bound q n $. 

Given a chain Xt, we say that the chain (Xt,Yt) is a lifting of the original 
chain if the marginal distribution of Xt is the same as the distribution of 
X t . 

Theorem 3.2. Suppose that a Markov chains Xt has a lifting (Xt,Yt), 
and that \& is an eigenfunction of the lifted Markov chain: (Xt+i,Yt+i)\(Xt, 
Yt)] = X^(Xt,Yt) . Suppose that \^f(x,y)\ is a function of x alone, |A| < 1, 
3?(A) > 1/2, and that we have an upper bound R on E[\^f (Xt-\.\ ,Yt+\) — 
ty(Xt,Yt)\ 2 \(Xt,Yt)]. Let 7 = 1 — 3?(A). Then when the number of steps t is 
bound by 

t< logqW + (l/2)log 7 e/(4fl) 
- log(l - 7) 

the variation distance satisfies \\Xt — TrW^y > 1 — e . 
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For a discussion of Theorem 3.2, see [13, 15, 16]. 

Lemma 3.5. For e > 0, there exist constants C(e),N > such that, for 
n>N, 



Jn,3 



71"||TV > 1 — £ 



for m < Cn 3 log n. 



{x; 


\j),Z t (j) + l), 


m 


= cr n - 


-i,X t 


Hj) 




n 


or 




\j)-l,Z t (j)), 


m 


= On- 


-2,X t 


\j) 


> 


n 


-1, 




m 


= Vn 


or 


\j) 












m 


= o- n . 


-i,X t 


< 


n 


— 1 or 






Vt 


= Cn- 


-2,Xt 


Hj) 


< 


n 


-2, 


(n - 


■l,Z t (j)-l), 


Vt 


= o- n . 


-u x t 


Hj) 




1. 




(n- 


•2,Z t (i)-2), 


m 


= cr n - 


-2,Xt 


\j) 




1. 





Proof. Let X^ l {j) = f denote that the card with label j is at position 
f at time t. First we lift the chain to (Xj~ , Y t ) = (Xf 1 ,t mod n). Let Z t (j) = 
(X^ 1 (j) - Xq 1 {j) + Y t {j)) mod n and let rj(t) £ {<7„_ 2 , cr n _i, a n ] denote the 
cycle that is chosen at time t. Then, 



(X-\(j),Z t+1 (j)) 



Define v(x) to be the xth number in the list 

A n " 3 ,...,A,l,xi,Xo 

and define the functions 

^j(X t -\Y t ) = t;(^- 1 (i))exp(^(i)27rVn), 

n 

^{Xt\Y t ) = Y J * ] {Xt\Y t ). 

3=1 

Now we will find values for \,XiiXo that make (and, hence, ^f) an 
eigenf unction. Also note that \ty(Xf ,yi)\ = \^(XjT ,y2)\ for all y\,y2- If 
2>Xf 1 (j) >n-2, then 

V j (X^ 1 ,Y t+1 ) = \y i (Xr 1 ,Y t ). 

Let w = e 2m / n . By looking at what happens when X^~ l (j) = 1, X t _1 (j) = n, 
and = n — 1, we find that ^ j is an eigenfunction with eigenvalue A 

when the equations 

Xo + Xiw- 1 +w- 2 = 3X n ~ 2 , 
— + 2w = 3\, 



Xo 



+ w = 3\ 
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(3A-w)(3A-2w;)' 
2 



are satisfied. In particular, 

Xo 

Xl = 7TT 

6X — w 

and A is a root of the polynomial 

/(A) = 9A n - 9wX n ^ + 2wX n ~ 2 - 3uT 2 A 2 + w^X. 

We will use Newton's method to approximate a root of /(A) starting with 
z = 1 and = z fc - f(z k )/f'(z k ). By Taylor's theorem, 

< \ max |r(p«fc + (1 -P)*fc+i)l • 2 - 
2 o< P <i J \z k ) 

Furthermore, since 

/'(A) = OnA™" 1 - 9(n - l)wA n ~ 2 + 2w(n - 2)A"" 3 - 6uT 2 A + uT 1 , 

f"{X) = 9n(n - l)X n ~ 2 - 9(n - l)(n - 2)wX n - 3 

+ 2w(n - 2)(n - 3)A n " 4 - 6w~ 2 

if z = 1 + 0(l/ra 2 ), then /'(*) = 2n + 0(1) and /"(*) = 2n + 0(n). So if 
Zk = 1 + 0(l/n 2 ) and z fe+ i = 1 + 0(l/n 2 ), then 

lx/ m ^ 1 + 0{l/n) , ,/ m2 

l/(*fc+l)l< — ^-^1/(^)1 ■ 

Furthermore, 

/(z ) = 9 _ 7 W + it," 1 _ 3 W ~ 2 

36vr 2 .4vr 4 
= — 7. i h 0(1 n ). 

Consequently, by induction, 

2 k 



\f(z k )\<i(^j +o( 



7 2 fc +2 



\z k+ i-z k \ = - - +0 

n\nj \n z 

So for n sufficiently large, the sequence {z k } converges to a point z^ and 
by continuity, f{z (yo ) = 0. Furthermore, since 

f'(z ) = 9n - 9(n - l)io + 2w(n - 2) - 6w~ 2 + uT 1 

= 2n-il4ir + 0(l/n), 
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Re(zi] 



1 - Re 



/(*>) 

/'(«>; 



Re(/(z )) Re(/'(«o)) + Im(/(«o)) M/'(*b)) 



18vr 2 + 14tt 



??,- 



l/'(^o)l 2 

+ 0(l/n 4 ). 



Finally, since 



2vr 2 



+ 0(l/n 4 



there exist C2 > c\ > such that 



1 - + 0(l/n 4 ) > Re(zoo) > 1 - % + 0(l/n 4 ). 



With A 



Zoo, Xo = 1 + 0(1/ n), an d Xi = 1 + 0(1/ ri). Consequently, 
*W = n + 0(l/n). 
Now we estimate R. Since |A — 1| = 0(l/n 2 ), 

vf i (xr + \,it + i)-^(x t - 1 ,y t ) 



w 



Zt{i) 



(A-l)A x * 1 ( i )=0(l/n 2 ), 


2<Xt 


X W< 


n-2, 


X0-A"- 3 = O(l/n), 


M*) = 


i, 


Vt = 


- fn, 


Xi^" 1 -A n " 3 = 0(l/n), 


M*) = 


i, 


Vt = 


- Vn-1, 


w -2_ \n-3 = 0(l/n), 


Mi) = 


i. 


Vt = 


- 0"n-2j 


Xi-Xo = 0(l/n), 


Mi) = 


n 


Vt = 


= cr n , 


wXo-Xo = 0(l/n), 


Mi) = 


n 


Vt = 


= &n-l, 


wxo -Xo = 0(l/n), 


Mi) = 


n 


Vt = 


= Cn-2) 


l-Xi = 0(l/n), 


Mi) = 


n 


- 1 


Vt = &n, 


l-Xi = 0(l/n), 


Mi) = 


n 


- 1 


Vt = 0"n-l 


wXi -Xi =0{l/n), 


Mi) = 


n 


- 1 


Vt = On-2 



Consequently, 

\*(Xr + \,Y t+1 ) - *(Xf\Y t )\ = 0(l/n) 
and we can take itl = 0(l/n 2 ). The result follows by Theorem 3.2. □ 

4. Proof of Theorem 1.2. In this section we focus on the reversible walk 
\(ln,k n +Qn k )■ F° r reversible chains, path comparison is a useful technique 
for studying rates of convergence (see, e.g., [4, 5, 6, 8]). In particular, many 
of the arguments in this section rely on the notion of a flow to compare 
top to bottom-/c shuffles with the well-studied random transposition walk. 
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Together with estimates on the least eigenvalue, this approach yields L 2 
mixing time bounds. 

To begin, consider a symmetric probability measure gona finite group G 
and fix a symmetric set S that generates G and such that q(s) > for s £ S. 
Define paths in the Cayley graph (G, S) to be sequences 5 = (e, 2/1,2/2, • ■ • , Uk), 
where e is the group identity and y^ l yi + \ G S. Given such a path, define its 
length to be \8\ = k, and for each s £ S, let 

N(s,6) = \{ie{0,...,k-l}\ y r 1 y i+1 = s}\. 

That is, N(s,5) is the number of times the generator s is used in the path 
8. Furthermore, let ds(x,y) denote the graph distance on (G,S) between x 
and y. 

Definition 4.1. Fix two symmetric probability measures g,gona finite 
group G and a symmetric set generating S C supp(g). A (q, g)-flow is a 
nonnegative function r] on the set of all paths V in the Cayley graph (G, S) 
such that 

ri(8)=q(y), 

sev y 

where V y is the set of all paths from the group identity e to y contained in 
V. 

4.1. The least eigenvalue. This section presents a lower bound on the 
smallest eigenvalue of the chain \{q n ,k n +1n,k n )- The proof relies on a geo- 
metric result that bounds the eigenvalues of symmetric chains by consider- 
ing loops at the identity of odd length. (See [6] for details.) Together with 
comparison, Lemma 4.1 will be used to derive estimates on mixing time in 
Section 4.2. 

The following definition of an odd flow is analogous to that of a flow, but 
restricted to paths of odd length. 

Definition 4.2. Fix two symmetric probability measures g,gona finite 
group G and a symmetric set S C supp(g). A (q, q)-odd flow is a nonnegative 
function r\ on the set of paths of odd length O in the Cayley graph (G, S) 
such that 

SeOy 

where O y is the set of all paths of odd length from the group identity e to 
y contained in O. 
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Note that we are not assuming that S generates G, that is, the Cayley 
graph (G, S) need not be connected. However, the existence of a (q, q)-odd 
flow implies that, for each y with q{y) > 0, there is at least one path from e 
to y in O. 

Theorem 4.1 ([6]). Fix two symmetric probability measures q,q on a 
group G and a symmetric set S C supp(g). For any (q,q)-odd flow r\, 

a > _1 i \ ^ min 
A{rj) 

where j3 m \ a and /3 m i n are the smallest eigenvalues of q and q respectively, 
and 

Airy) = max — !— V \5\N(s, 5)n(S). 
scs q(s)^ 1 

It is well known that a chain q is aperiodic if and only if the least eigen- 
value satisfies /3 m i n = — 1. As a trivial application of Theorem 4.1, by taking 
S = {e} and q{e) = 1, we have (3 m \ a > — 1 + 2q(e). When our chain puts no 
weight on the identity, the above result provides a way to capture more 
subtle effects of aperiodicity on the least eigenvalue. 

Lemma 4.1. Let /3 m i n be the smallest eigenvalue of the symmetric chain 

Qn,k n = Uln,k n + Qn,kJ- Then 

a \ 1 i k n — 1 

Pmin > -1 + 



k n (n-k n + 2)(n+l) 



Proof. We will apply Theorem 4.1 with q(e) = 1 and q(g) = other- 
wise. In this case, f3 m i n = 1. Let 5 = supp(q nj fc n ). For / odd and such that 
n — k n + 1 < I < n, define paths 

<5| fcl = (e,a,,a I 2 ,...,oi) ±1 

and set O = {S t 1 \l odd, re — k n + 1 < I < n}. Let 



< 



2^n-fc„+Km< n 1/m 2 l^ 1 ] 2 

m odd 
n+1 1 X 

n-k„+2 X 2 I 2 

(n-fc n + 2)(re + l) 1 

k n 1 I 2 
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and rj(6) = otherwise. Then, 

.. 2k n {n-k n + 2){n + l) N(s,5) 

m s — —i — sgir 

_ 2k n {n-k n + 2)(n + l) 
k n — 1 

The result follows from Theorem 4.1. □ 

Theorem 4.1 gives the best results when we can use short paths. In the 
case of \{q n .k n + Qn fc„)' f° r paths 5 with \5\ < [ n "^ fc " J , the card originally in 
position [ n ~ fcn j -|- 1 moves distance ±1 at each step along the path. Con- 
sequently, the shortest loops at the identity with odd length have length 
~ n k n . 

4.2. Bounds on mixing times. The following lemma gives a lower bound 
on the mixing time of \{q n ,k n + Qnk„) f° r sufficiently small by looking at 
the motion of an individual particle. 

Lemma 4.2. Let q Utkn = \{q n ,k n + q* n , kn ) with k n <cn, < c < 1. Then 
there is a constant N(c) such that, for n> N , and I < c ^ 1 ~^n " > 

l c 

\\<ln,k n ~ ^ll-TV - 2' 

In particular, there is a constant A(c) such that the total variation mixing 
time satisfies 

T(S n ,q nik J > An 2 . 

PROOF. Note that the card originally in position |^ ( 1 ~ 6 ) n j _|_ \ performs a 
simple random walk on {1, . . . , [(1 — c)nj} before hitting any of the bottom 
[cn\ positions. Call this card a and define the event 

A = {o~\n — [cn\ < cr~ 1 (a) < n}, 
that is, a is in the bottom \cn\ positions. Then tt(A) > c— 1/n. For / = 
y c{i-c) n j \ e i Xx, . . . ,Xibe an i.i.d. random variable with P(Xi = ±1) = |, 
and let Sj=J2{Xi- Then 

'1 — c)n~ 



qUM)< P 



max I Si I > 
i<j<r 2 



41 

(1 ^) 2 ra 2 



< ^r - ^ (by Kolmogorov's maximal inequality) 
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Since \\q l n kn — 7t||tv = maxAcS 1 ™ \<t n k (A) — vr (^4) | , the result follows by tak- 
ing n sufficiently large. The mixing time bound follows from the fact that, 
for n sufficiently large, 

c<2||^ fen -7r|| TV < e -LW„,<r n , fe Jj_ 

In particular, 

T(S n ,q n ,k n )> 1 _ logc - □ 

Now we will derive an upper bound on the mixing time of \{q n ,k n + Qn k n ) 
with n — k n <C independent of n. That is, the symmetric version of the walk 
that moves the top card uniformly at random to any but a finite number of 
the top positions. The proof is by comparison and is based on the following 
two results. For proofs of these results, see, for example, [4, 5, 6]. 

Definition 4.3. Given a finite group G and a symmetric probability 
measure q, define the Dirichlet form 

2 \ G \ x,y 

Note that £ q (f, f) = ({I — Q)f, f)L 2 (n)i where Qf h-> / * q is the Markov 
operator associated to q. 

Proposition 4.1. Assume that £ < AS. Then, 

T(G, q) < T 2 (G, q) ± max {^(G, g), Alog \G\, _ J j, 

where f3- = max{0, — /3 m in}- 

Theorem 4.2. For any (q,q)-flow, £ < A(n)£ with 

A(r,)=max^. , £\S\N(8,S)T,(S). 
se5 ?( s ) s&> 

The proofs of the following two mixing time bounds are by comparison 
with the random transposition measure on S n \ 

( l/n, if g = e, 
QRT,n(g) = \ 2/?i 2 , if g=(i,j),i^j, 
{ 0, otherwise. 

Lemma 4.3. For k n >n — C, there exist constants B(C) such that 

T 2 (S n ,q nt k n ) <Bnlogn. 



s i,3 
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Proof. Let S = {of 1 - n — k n + 1 < / < n}. First we define paths 5ij, 
1 < * < j <n from e to (i, j) in the Cayley graph (S n , S): 

o^OjoJ^ai, C + 1 < i < j < n, 

(a^f-^acl^j+c^+ivjlc-ivc+i^-^ 1 , l<i<C, 

i < j < n — C , 

j >n-C. 
Define a (qRT,mQn,k„ 

) flow by rj{5ij) = 4y. For i < C, \S itj \ < 2(C - 2). And 
each s S S is used in at most n paths 5ij with i> C. Consequently, 

A(i])<8[C(C + 2) 2 + l}. 

Since T(S n ,q^T) ~ flogn (see [7] for details), the result follows by applying 
Lemma 4.1 and Proposition 4.1 together with Theorem 4.2. □ 

The following lemma bounds the mixing time of q n ,k n = \((ln,k n + Qnk n ) 
for arbitrary k n . The proof is by comparison with the random transposition 
measure, but while the flow defined in Lemma 4.3 used only one path for 
each transposition, here, for most transpositions, we define k — 1 paths. 

Lemma 4.4. There exists a constant A such that the mixing time satis- 
fies 

T 2 (S n ,q n:kn ) < An 3 logn. 

Proof. Let S = {erf 1 : n — k n + 1 < / < n}. The proof is by comparison 
with the random transposition measure qKT- For j > i > n — k n , define the 
path 

For i < n — k n , we define k n — 1 distinct paths 5\ ■ with n — k n < I < n. For 
j > I, let 



6 i,j = ( a i ) ki a i 



and for i <j <l, let 



So < 2n + 12 < 3n. Define a (<? RT , <? n , fc J-flow by i](5ij) = ^ and r](8lj] 
VZ=W Then ' 

<l 8n 2 + ^n. 
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Since T(S n , c/rt) ~ § logro (see [7] for details), the result follows by applying 
Lemma 4.1, Proposition 4.1 and Theorem 4.2. □ 

The following lemma shows the difficulty in applying path comparison via 
Theorem 4.2 to bound mixing time. 

Lemma 4.5. Consider a (q,q)-flow rj on (G,S). For 

^)= ma x-i-^|«5|iV( S) %(«5), 
q{s) S( _ p 

we have the lower bound 

A( n )>Y,d 2 s {e,g)q{g). 

In particular, for X C G, A(rj) > d 2 s (e,X)q(X). 

Proof. By averaging over s, 

^7)>£|<Ws,<W) 

s,S 

>J24(e,g)q(g). 

9 ^ 

Observe that we can always choose a (q, <7)-flow n such that 
A{rj) < (max— ^) ^ d 2 s {e,g)q{g). 

VS£6 q ^ S > J g€G 

Lemma 4.5 shows that the upper bounds on mixing time that we derive 
in this section are the best one can do using comparison with the random 
transposition walk. 

Consider a symmetrized variant of the Rudvalis shuffle driven by the 
measure r n which is uniform on the generating set {cn,^ 1 , (l,n),id}. This 
walk was analyzed in [15] and an 0(n 3 log n) lower bound was derived for the 
total variation mixing time (see, e.g., [12] for a matching upper bound). Here 
we use comparison to extend this result to lower bounds for symmetrized 
top to bottom-A; walks. 

Lemma 4.6. For < c < 1 and k n < cn, there exists a constant C > 
such that the I? mixing time satisfies 

Cn 3 

T 2 {S n ,q n ,k n ) > ^3-logn. 
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PROOF. Let S = {a^; 1 ^}, where r = (l,n) and observe that 

\ n ~ l „n-l 
(Tl = (T n - (<T n _J -<J n 

f~-l ^\ n ~ l ^n-l 

For n — k n < I < n, define paths 6 ±1 in the Cayley graph (S n , S) as above, 
and a corresponding simple (q n ,k n , r n )-fiow 77. Then 

4 



n n-k n <l<n 



A £ [3(n-/) + l] 2 

n n-k n <l<n 



<Bk 2 n 



for some universal constant .B. By Theorem 4.2, £g n fcn < Bk^£ rn . By Propo- 
sition 4.1, together with the lower bound on the mixing time for q' n given 
in [15], we have 

n 3 log n ■< max | AT 2 (G, q) , A log | G\ , _ 1q 1 | . 

By Lemma 4.1, — l/log/3_ = 0(n 2 ), and so either ^4T2(G, q) or Alog|G| is 
bounded below by n 3 logn. By Lemma 4.2, n 2 ^ T 2 (G, q), and so AT^G, > 
Alog|G|. Consequently, for n sufficiently large, 

n 3 logn < AT 2 (G,q) 

and the result follows. □ 



5. Proof of Theorem 1.3. We show that our estimates on the mixing 
times for q n j. n and q n ^ n yield bounds for the lazy top to bottom-/c shuffles. 
In order to transfer mixing time results for the reversible walk q n< k n to the 
present case of 

q n ,k n = \ {ln,k n + <5e)j 

we recall the following result. 



Proposition 5.1 ([6]). Let q be a probability measure on G and set 
q* = q*q* . Then 

T(G,q)<T 2 (G,q)<2T 2 (G,q*). 

More generally, if q v = q v *q* v , then T 2 (G,q) < 2vT 2 (G,q v ). Finally, q* v -kq v 
can be used instead of q v . 
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Lemma 5.1. For k n >n — C , there exist constants B(C) such that 

T2{S n An,kn) < Bnlogn. 
For arbitrary k n , there is a constant A such that 

T 2 (S n ,q n:kn ) < An 3 logn. 

Proof. By Proposition 5.1, it is sufficient to prove the bounds for 

Pn,k n = Qn,k n * 1n,k n - 

Observe that 

Pn,kn = Uln,k n + S e) * Uln,kn + S e) 

= 2^n,k n + \{Qn,k n *Qn,k n + S e )] 

•> In 

— 2Hn,k n - 

Consequently, £q nkn (/, /) < 2£ Pnkn (/, /). Note that p ri)kn is a positive oper- 
ator and, consequently, has nonnegative eigenvalues. The result then follows 
from Proposition 4.1, together with the L mixing time bounds for q n k 
derived in Section 4.2. □ 

To transfer total variation mixing time results for q n ,k n to the lazy top to 
bottom-^ shuffle, we make the following elementary observation. 

Definition 5.1. Let q drive a walk on G. Then for p G (0, 1), the asso- 
ciated p-lazy walk is driven by measure 

q P =pq+(l-p)5 e . 

Lemma 5.2. Let q drive a walk on G with stationary distribution ir, and 
fixp,e(z (0,1). Then there exists a constant C(p,e) such that mixing times 
for q and the associated p-lazy walk q p satisfy 



T(G, q p ) < max 



' £ T(G,q),C 



P 

Specifically, we can take C = 80/ \pe 2 ). 

Proof. Let S m be a binomial(m,p) random variable. Then 



\l P -tt||tv 

sec 



W\q™{g)-*{ 9 )\ 



Y^P{Sm = k)(q k (g)-K(g)) 
k 
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<J2 P ( S m = k)-\\q k -Tr\Wv 

k 

<P(S m <2T(G,q)) + ^. 
Taking fh > 2 ^T(G,q), by Chebyshev's inequality, 

P(S m < 2T(G,q)) < phSn-ESnl >(l- 2^) ES ™) 
< 1-P 



mp(l-2/(2 + e)) 2 ' 
And consequently, 



Q™ - ttIItv < -r—r-, — n , H 



mp(l - 2/(2 + e)) 2 2e 2 
1 

< — 

~ 2e 

for m > 80/(pe 2 ). □ 

Now we can transfer the mixing time results for (/ n ,fc n to q n ,k„ ■ 

Corollary 5.1. For k n >n — \J (nlogn)/2, there exists a constant C 
such that 

T(S n ,q n! k n ) < Cnlogn. 
For c G (0, 1) and k n > cn ; £/iere exist constants A{c) such that 

T(S n ,q nkn ) < An 2 log 2 n. 



Remark 5.1. For k n >n — (nlogn)/2, instead of using Lemma 5.2, 
we can adapt the coupling of Lemma 3.2 to show T(S n ,q n ^ n ) ~ 2nlogn. 
The coupling (Xf 1 , X™) of q n ,k n yields the coupling 

{x^,x^) = (xf m ,xl m ) 

of q n ,k n > where S m is an independent binomial(l/2, m) random variable. 
Then, if T is the coupling time for (Xf ,XJj* ), 

P(X^ + X?) < p(s m < (l + 0nlogn) + p(t > (l + 0nlogn 

For m = 2(1 + e)nlogn, the first term goes to by Chebyshev's inequality, 
and the second term goes to by the cut-off shown in Lemma 3.2. 
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The lower bound is also analogous to that given in Lemma 3.2, where we 
now make the observation that 

P{Lj>m)>p(Lj> (l-^jnlogn) -p(s m < fl-|jnlogn\ 



So for k n >n — ^ (ralogn)/2 and e G (0, 1), 



and 



Km ||n (1 " e)2nlogn -ttII -1 



Km ll(7 (1+e)2nlogn - vrll -0 
^H^fen ^IItv- u - 



Finally, transferring the lower bounds for k n = 2,3, which were derived 
using Wilson's lemma, also requires only a simple argument. Let {r/j} be 
i.i.d. Bernoulli random variables with p = 1/2, and let Nt = J2i=iVi- Then 
if Xt is the top to bottom-A: process, the lazy top to bottom-A; process is 
given by X t = X^ t ■ Using the notation of Theorem 3.2, if (X t ,Y t ) is a lifting 
of Xt, then {Xt,Yt) = (XN t ,YN t ) is a lifting of Xt. It is not hard to check 
that the assumptions of the theorem are met with $ = ^, A = 1/2 + 1/2A, 
R = R/2, and 7 = 7/2. Then 

lQg^max+ (l/2)log7£/(4fl) _ log ^ max /2 + (1/2) log 7g/(4g) 
-log(l- 7 ) -log(l- 7 /2) 

Using the estimates in Lemma 3.5 and [15], we have the following lower 
bounds. 



Corollary 5.2. For k n = 2,3 and e > 0, there exist constants C{e), 
N > such that, for n> N , the lazy top to bottom-k shuffle satisfies 

for m<Cn 5 log n 
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